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1 Abstract 

Classical mean- variance portfolio theory^^ tells us how to construct a portfo- 
lio of assets which has the greatest expected return for a given level of return 
volatility. Utility theory then allows an investor to choose the point along 
this efficient frontier which optimally balances her desire for excess expected 
return against her reluctance to bear risk. The means and covariances of the 
distributions of future asset returns are assumed to be known, so the only 
source of uncertainty is the stochastic piece of the price evolution. 

In the real world, we have another source of uncertainty - we estimate but 
don't know with certainty the means and covariances of future asset returns. 
This note explains how to construct mean-variance optimal portfolios of as- 
sets whose future returns have uncertain means and covariances. The result 
is simple in form, intuitive, and can easily be incorporated in an optimizer. 

Various approaches already exist to improve portfolio construction in the 
presence of uncertain dynamics. Factor models^ and random matrix theory^^ 
can be used to provide de-noised covariance and correlation matrices as in- 
puts to optimizers, thereby ameliorating the effects of over-fitting. They do 
not, however, allow one to correct for the effects of uncertainty in expected re- 
turns. Re-sampled efficient frontiers^ provide a reasonable, simulation-based 
way to assess the stability of a given portfolio's performance to sampling 
uncertainty - and re-sampled efficiency is a reasonable, if ad hoc, metric to 
consider when constructing a set of portfolio weights. Other approaches exist 
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as well , and I won't attempt to enumerate or summarize the long list. But 
a tractable, closed-form, theoretically-grounded approach to incorporating 
the joint effects of uncertainty in expected returns and covariances is, to my 
knowledge, lacking. This is odd because, as we shall see, it's not that hard. . . 



2 A Brief Review 

We start by setting some notation and re-deriving the classical results: An 
investor will invest a fraction /o of her wealth in a riskless asset and 
fractions fifz^i-N} in each of risky assets. The risky assets' prices Pi are 
assumed to undergo known covarying diffusions 

dPi 

= fii{Pi) dt + cTi(Pi) dzi, {dzidzj) = pij dt 



dP 



so that 

dW ={hW)rdt + Y,{UW)'^ = W[rdt + Y,f,{p,-r) dt + f,a,dzi 

i=l ' i=l 

and the investor is assumed to choose her investment fractions so as to max- 
imize the expected change in her utilit}0, U, where 

Ut = UiWt) = -{Wt" - 1), X < 1. 

X 

Noting that U' = > and that U" = (x - 1) < 0, Ito's lemma 

gives us 

N ^ N 

{dUt) = Wt^[r + Y,fi if^i ~r) + ^ Yl P^^^ 

i=l *ii=l 

Thus we want to choose the fi to maximize a quantity 

N ^ N 

Q = /i (/^i - + Y ^' 

i=l i)i=l 



"'^We use a power-law utility function for convenience only. Other choices of utility 
function with positive first derivative and negative second derivative simply lead to a 
different risk-aversion constant in the expression for Q below. The analysis goes through 
without change. 
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Note that we expect an investment fraction, a dimensionless quantity, to be 
on the order of since this is the simplest dimensionless quantity we can 
form from the parameters that describe the dynamics (only excess return is 
relevant to investment in a risky asset). Indeed, in a world with only one risky 
security it is the case that the optimal investment fraction is exactly We 
will see that the optimization problem is greatly simplified by denominating 
our investment fraction in these units. . . We define 

so that 

1=1 * i,j=l * ^ 

N X 1 ^ 

= Cj Si + ^ Ci Cj Si SjPij 

i=l i,j=l 

where S denotes Sharpe Ratio. Defining the N x N matrix 

^ • ^ij — Pij 

and the x 1 vector 

A : A, = = ^ii 

we can write 

Q = c*A + ^^c*$c. 
2 

If all drifts and covariances are known with certainty, we can maximize Q 
over the q as per usual: 

= {dc,Q I c = c*) ^Ai + {x-l) {^c*)i, V i e [1, TV] 

which implies that 

c* = ^^$-^A 
1 — X 

or, equivalently, 

/• = ^[*-'A)l.f 
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which is the standard Markowitz result in our notation. 

It's worth noting what our choice of variables has gained us: The previous 
equation makes transparent the fact that, in order to maximize her (change 
in) utility, our classical mean-variance optimal investor should allocate her 
risk budget among risky assets "in proportion to" their Sharpe ratios. Specifi- 
cally, if she's choosing how to spread her capital among N uncorrelated assets 
then A = I and she'll invest a fraction fi* of her capital in the i-th risky 
asset so that the risk, /i*(Jj, is equal to Nothing new so far. . . 

3 Introducing Parameter Uncertainty 

But what if the drifts and covariances are themselves uncertain? Now our 
investor's future utility is uncertain not only because of the noise intrinsic 
to the risky assets, but also because her ability to characterize those risky 
assets is imperfect. This is a bad thing for her because d'^ {dUt) /dfidfj is 
negative (recall a; < 1), and therefore (dUt) declines as the uncertainty of her 
estimated fi increases - even if she has the right values on averageo We'll 
choose portfolio weights so as to maximize her expected future utility, where 
the expectation is taken over both the return uncertainty for a given process 
and the parameter uncertainty for that process. Let's define fii to be her 
estimate of /Xj, the true expected return for asset i. Similarly, let (jj, and 
Si be the estimated values of ai, pij and Si. Finally, we'll define 

- ■ f - - 



SO that 

_ fij-r _ ^ fii-r 

^Just as the positive convexity of an option's payout with respect to the price of the 
underlying asset gives rise to time- value if future prices are uncertain. . . Note that this 
same negative convexity justifies fractional Kelley strategies in the realm of proportional 
betting systems. 
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Note that the q and q simply represent the same real- world investment 
fractions /j in different units. This means that 

N 2 1 ^ 

. , C'j Z . O j (Jj 

1=1 2J = 1 

Ci {— ) \ — } ^ Ci Cj— Pij — ^ ^ 

.^^ CTi l^i-r 2 f-^^ Gi Gj Gi Gj Pij 

N ^ N 

i=i ij=i * ^^'^ 

It's important to note that any term in Q involving pi is zero when p^ — r, 
which implies that 

+00 r—£ 00 

dpP{p) g(/i) =lim( J + J )dpF{p) Q{p) 

—00 —00 r+£ 

This allows us to write 



where 



^ • Pij 

\Gi Gj pij I 

and * denotes element-by-element multiphcation (Matlab's .* operator or 
R/S- plus's * operator). Note that we don't need to address the presence of 
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the pole in {^j-zp^ because the evaluation of (Q) requires only ^ ^^4-^ ~ ^/ , 
Maximizing (Q) over the q gives 

1 — X 

or, equivalently, 

Plugging in these optimal investment fractions gives 

(Q) = ^ (A * Af (I. * B) (A * A) 

or 

{dUt)* = dt Wt^'ir+l (A * ly ($ * B) -1 (A * A)] . 

This formula for fi* is our main result. Note that when there's no 
uncertainty in expected returns or covariances then Ai = Bij = and 
we regain the standard answer. A bit more generally, if Ai = a,\/i and 
Bij = b,Wi,j, i.e. if relative uncertainties in drift and volatility are assumed 

— * 

to be the same across all securities, then c* is equal to | times the standard 
result - which just corresponds to a change of leverage. Since most real- world 
investors specify their risk-tolerance exogenously and use optimizers only to 
determine relative position sizes, the above prescription for incorporating 
parameter uncertainty has no real effect in this circumstance. In other words, 
the prescription is likely to be useful only when relative uncertainties in 
expected return and volatility differ across securities and/or uncertainties in 
correlations are introduced. 



4 Practical Application 

This is all fine and dandy, you say, but what should I actually do7 How 

— * 

can one quantify these A and B objects? Well, if an investor's estimation 
procedure is quantitative then the fitting procedure might return covariances 
for the estimated parameters that allow A and B to be computed directly. 
If not, then simple tools can allow an investor to quantify her degree of cer- 
tainty. In the following paragraphs I describe a few reasonable but ad hoc 
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parameterizations that allow closed form expressions for A and B (ignoring 
correlations among uncertainties). Plots of these functions with "sliders" for 
the input parameters can be built as part of an optimizer's GUI to enable 
the non-quantitative portfolio manager to include the uncertainty of her es- 
timates, as well as the estimates themselves, in the portfolio construction 
process. 

The remainder of this paper is organized as follows: First we propose simple 
and reasonable distributions that an investor can use to describe her uncer- 
tainty about expected returns, volatilities and correlations. Then we explain 

— * 

how to compute A and B using these distributions. Having set the stage, we 
conclude with a detailed, quantitative example. 

Estimate of < > f^^r • 

Example 1: Let's assume we have an unbiased estimate of the expected 
return of the i-th security fii — fj,i + Xi, Xi ~ N{0, Ej) so that (jii) — /li. This 
allows us to write 

A = (t^\ I/,~7V(0,^) 

Note that is iust the relative uncertainty in the estimate of the expected 
excess return of the i-th security. A simple exercise in Gaussian integral 
evaluation gives us: 



e^O" J 1+2/ ' M- 



oo —1+e 

0-2 2^ n! 2" (2n+l) 0-2" 
n=0 

This series converges quite quickly, even for unrealistically small values of cr, 
i.e. even when there's a high degree of certainty that jl is close to the true 
value of jl. The resulting function A{a) looks like this: 
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Sigma 



This shape is easy to understand quahtatively: If the uncertainty in i/i is 
zero then yi is equal to its mean value of zero and Ai = 1. For values of -j^^ 
small enough that there's negligible probability of |/i < — 1 then the positive 

convexity oi j^,,yi > -1 ensures that (^j;^^ > {j:^ = !)• But for very 

large values of -j^t;:, P{yi) becomes so broad that {j^^ ~ = by 

symmetry. 

Example 2: How should we compute if we're concerned that our fore- 
cast alphas are biased upward as a result of data mining? In this case, it's 
instructive and qualitatively reasonable to assume that fii — /li + Xi, where 

Xi ~ N(t^^^Y~, 3ipi-ir) y ^Q^Q |;]-^a^|; we've "deflated" our expected returns but 
we're still ascribing predictive power to our data-mined results since an ob- 
servation of /ij = r is deemed to be 2-sigma event. Note, too, that this 
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assumption fixes A: 
2 

= 0.85. 



Estimate of (?) : A convenient way to parameterize our estimate of the 

i-th security's return volatility is Ui = a^e^^Xi ~ A^(— S,) where I'm de- 
noting the uncertainty by for reasons of convenience and familiarity, but 
the S, in this discussion of is not related to the S; in the discussion of 
above. This parameterization ensures that {ai) = a;. P{ai < 0) = and 
allows us to compute l^) = e^'^ > 1 and (^) = e^^*^ Sj can be cho- 



sen to represent the uncertainty of the volatility forecast using the relation 
Ej^ = ln(l -|- ™^2''* ) or by fitting a stochastic volatility model. 

Estimate of | : Dropping subscripts, we start as before by writing p — p+x, 
where x & [—1 — p, 1 — p] has zero mean if p is assumed to be an unbiased 



estimator. This gives {^) — \ -nr ) ^ where y = - e 



,P/ ~ \i+2//' « ~ ^' " L IpI' ^IpI 

has zero mean. Note that the fact that y has zero mean but is distributee 

over a range symmetric about —1 implies that the pdf for y will have pos- 
itive skew. We can learn quite a bit about the form of ( 4 ) as a func- 



tion of p without doing any calculations: To begin with, we know that 
symmetry requires that (^) — >Oasp— >0 (assuming P{p) unimodal). 



We also expect that — > 1 as |p| — > 1 because P{p) — > S{p±l) as 
(p) — > t1- Finally, we see that > 1 for \p\ — 1 — £, |£| << 1 because 
(f ) ^ ( 1+^) ^ ~y + y^ + h-O-) ~ 1 + {y"^) > 1 when the support for y 
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is near zero, as it is when |p| = 1 — £. Thus, as a function of p should 
behave quahtatively as follows: 
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A useful form for the probability distribution of p that's simple to work with 
is: 

P(p) oc {l-e^'^'-^^Y{l±pY~'"^'^ 

The user needs to find two exponents that characterize her estimate, normal- 
ize the distribution, and use it to compute ( ^ ) • By doing so, one can easily 



verify that the preceding figure is indeed a reasonable guide to the value of 



4\ for a given value of (p). 
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5 A Detailed Example 



Consider a world with just two risky assets. The (annualized) parameters 
that will govern the time evolution of their prices are: 



Pl2 



1^2 
(72 
= 0. 



r + 10%, 
30%, 



These parameters are unobservable and can only be estimated. If an in- 
vestor did know the true parameters, symmetry implies that she'd choose 
/i = /2 and achieve a (leverage-independent) Sharpe ratio = (expected ex- 
cess return / return volatihty) = 10%/^^ ~ 0.47 at whatever leverage her 

utility function happened to dictate (the factor of \/2 is just the diversifi- 
cation effect of our assumption that the return processes are uncorrelated) . 
Any parameter uncertainty will reduce the Sharpe ratio to less than 0.47, 
because fii ^ /i2and/ora"i ^ ui will lead to a sub-optimal weighting in which 

/i ^ h. 



Of course, the parameters are not known and must be estimated. Let's 
assume that an investor's estimates will be unbiased and may be thought of 
as being drawn from the following sampling distributions: 

/ti = //I + zi, e A^(0, 5%), so that stdev(^) = ^ = 0.5 
/i2 = + Z2, Z2 e A^(0, 10%), so that stdev(g) = = 1 



= (Tie^i, xi e A^(-0.5%, 10%), so that stdev(f^ 



10% " 

~ 30% 
9% 



a2 = a2e''\ X2 G Ar(-4.5%, 30%), so that stdev(f|) = ^^^^ 
Pi2 = Pi2 = (to prevent overcomplicating the example) 

Recalling that 



0.1 and 
0.3 and 



£1 

171 

£2 
, 0-2 



g-l/(2x,2) oo^ 



Xi 



^ n! 2" (2n + 1) x^^" 



stdev(/ii) 
fit 



Bj, 



e^^'^and Bij^i^j) = e 



we see that our assumptions imply that 



A 



1.28 
0.73 



and B 



var^fjij 

1.03 1.11 
1.11 1.31 



- 1, 
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We now run the following experiment : 

Step 1: Initialize three accounts with $1 of capital, i.e. 1^1,2,3(0) = $1. 
Step 2: Draw fii, fi2, ai, from the above distributions. 
Step 3: Compute 

^[($*S)-HA*i')].|,and 

Step 4: Generate one-step returns r\^2 for securities 1 and 2 using their true 
parameters and apply them to the holdings: 

Wi^Wiexp(J]/,~r,), 

i 

W2^W2exp(J]/>^**-r,), 

i 

W3^W3exp(5;]//™V,). 

i 

Step 5: Goto Step 2. 

We repeated this loop 100,000 times, then computed the three portfolio re- 
turn series Rj(i) = \\^{-^^-^ and found: 

Sharpe ratio{ Ri(t) } = 0.27 
Sharpe ratio{ R2(t) } = 0.37 
Sharpe ratio{ Rajt) } = 0.46 

These are qualitatively as we expect: 

1. The true parameters gave the expected result of about 0.47, up to 
sampling error. 

2. The naive application of the standard mean- variance framework using 
the estimated parameters gives 0.27, a much worse result than the 
theoretically optimal 0.47 due to the negative convexity of the expected 
change in utility with respect to the investment fractions. 



f naive 

f better 

f true 

Ji ~ 
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3. The approach outhned in this paper gives 0.37, a result significantly 
better than 0.27 but still worse than 0.46. 

Finally, note that we've also run experiments in which we allowed the in- 
vestor's estimates of her uncertainty to themselves be wrong, i.e. we've run 
the above experiment with J^'"^**^'^' computed using noisy A and B (stemming 
from imperfect knowledge of how noisy arc the parameter estimates). The 
improvement in risk-adjusted performance is found to be very robust, i.e. 
substantial errors in A and B do not appreciably diminish the performance 
and Sharpe ratio{ R2{t) } was well above Sharpe ratio{ Ri{t) } in every ex- 
periment. 
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